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Abstract. We review the non-trivial issue of the relativistic description of 
a quantum mechanical system that, contrary to a common belief, kept theo- 
reticians busy from the end of 1920s to (at least) mid 1940s. Starting by the 
well-known works by Klein-Gordon and Dirac, we then give an account of the 
main results achieved by a variety of different authors, ranging from dc Broglie 
to Proca, Majorana, Fierz-Pauli, Kemmer, Rarita-Schwinger and many others. 

A particular interest comes out for the general problem of the description 
of particles with arbitrary spin, introduced (and solved) by Majorana as early 
as 1932, and later reconsidered, within a different approach, by Dirac in 1936 
and by Fierz-Pauli in 1939. The final settlement of the problem in 1945 by 
Bhabha, who came back to the general ideas introduced by Majorana in 1932, 
is discussed as well, and, by making recourse also to unpublished documents by 
Majorana, we are able to reconstruct the line of reasoning behind the Majorana 
and the Bhabha equations, as well as its evolution. Intriguingly enough, such 
an evolution was identical in the two authors, the difference being just the 
period of time required for that: probably few weeks in one case (Majorana), 
while more than ten years in the other one (Bhabha) , with the contribution of 
several intermediate authors. 

The important unpublished contributions by Majorana anticipated later 
results obtained, in a more involved way, by de Broglie (1934) and by Duffin 
and Kemmer (1938-9), and testify the intermediate steps in the line of reason- 
ing that led to the paper published in 1932 by Majorana, while Bhabha took 
benefit of the corresponding (later) published literature. Majorana's paper of 
1932, in fact, contrary to the more complicated Dirac-Fierz-Pauli formalism, 
resulted to be very difficult to fully understand (probably for its pregnant 
meaning and latent physical and mathematical content): as is clear from his 
letters, even Pauli (who suggested its reading to Bhabha) took about one year 
in 1940-1 to understand it. This just testifies for the difficulty of the problem, 
and for the depth of Majorana's reasoning and results. 

The relevance for present day research of the issue hero reviewed is outlined 
as well. 



1. Introduction 

The birth of quantum mechanics was driven by the basic principles of the theory of 
relativity. Indeed, in 1923 L. de Broglie was the first [T] to exploit Lorentz invariance 
in order to formulate the well-known relations between the energy / momentum of a 
particle and the frequency /wave- vector of the associated wave. According to P.A.M. 
Dirac [2] , even the subsequent formal development of de Broglie's ideas did not led 
E. Schrodinger [3] to write down first his most famous (non-relativistic) equation 
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but, rather, the relativistic wave equation now known after O. Klein and W. Gor- 
don [4] (which, for some time, was referred to as the relativistic Schrodinger equa- 
tion). The original reasoning by Schrodinger started from the relativistic energy- 
momentum relation for an electron, 

w2 

^+p2 + m2c2 = 0, (1) 

and then assumed that the electron would be described by a wave-function t) 
satisfying the equation obtained by making in ([T]) the replacements 

> — jr,, p^-thV, 2 

c c ot 

from which the de Broglie's relations came out for a plane wave e'(*' ''~"*). The 
resulting, first discovered, wave equation was then the relativistic Klein-Gordon 
equation: 



Schrodinger then abandoned his relativistic wave equation since it gave the wrong 
predictions for the fine structure of the hydrogen atom, but later realized that the 
non-relativistic approximation to his relativistic equation, the proper Schrodinger 
equation, led to some correct results despite the original relativistic formulation. 
Schrodinger and others soon recognized that the source of discrepancy between 
the relativistic wave equation and observations was the neglect of the spin of the 
electron (the Klein-Gordon equation describes spin particles) but, as well-known, 
we have to wait until 1928 when Dirac discovered [5] how to incorporate the spin of 
the electron in wave mechanics, in a consistent, relativistically invariant manner. 

The enormous success of the Dirac theory after the discovery of the positron in 
1932 [5], and especially its quantum field theory formulation and incorporation in 
quantum electrodynamics, almost entirely obscured other subsequent formulations 
and generalizations of relativistic wave equations for particles with different value 
of the spin. Indeed, the early history of such equations is quite rich and intriguing, 
and extends over almost two decades. In the present paper we give (in the follow- 
ing section) an historical account of the different formulations of relativistic wave 
equations appeared in the literature till the end of 1940s, together with the physi- 
cal motivations for them. Subsequent elaborations were mainly aimed at achieving 
mathematical improvements, and will not be considered here. It is mandatory to 
recall that relativistic wave mechanics derives its physical relevance just from its 
incorporation into quantum field theory (sometimes referred to as "second quanti- 
zation" ) : as later realized, indeed, a relativistic quantum theory of a fixed number 
of particles is an impossibility. While some mentions about the quantum field the- 
ory justification of the equations proposed will be added occasionally, given their 
relevance in the subsequent developments, we will not dwell upon this topic which 
is far beyond our aim (for several aspects, see, instead, Ref. [?])• We will refer 
chiefly to the original papers (primary sources), since no exhaustive historical re- 
views are known; see however the partial but beautiful, though dated, review in 
Ref. [8] (see also [9]). While such papers tell us the known, though not widespread, 
story on this subject, in Sect. 3 we focus on several important results achieved by 
the Italian physicist Ettore Majorana [10] [11] but not published by him. They are 
contained in some booklets with his personal research notes, which only recently 
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have been published [12] [13], and clearly testify (once more [M]) how he antici- 
pated key results from different authors obtained over the subsequent years. In this 
respect, it is as well interesting to point out how results published by Majorana 
were received by his contemporaries, since they were fully understood only years 
later (a first illuminating example being just the mentioned Ref. [9]). From the 
correspondence of W. Pauli, then, we will be able in Sect. 4 to directly follow the 
study of the seminal paper published by Majorana in 1932 [15], that occupied Pauli 
for about one year. Finally, in Sect. 5 we summarize the results reviewed and give 
our conclusions. 



The original rejection by Schrodinger (and others) of Eq. ^ as the correct quantum 
equation describing an electron was based, as recalled above, on the comparison of 
its prediction with the accurate experimental spectroscopic data on the hydrogen 
atom, and such discrepancy was correctly attributed to the non consideration of 
the spin of the electron. However, when Dirac approached the problem of making 
a relativistic theory of the spinning electron, no reference to such discrepancy was 
made, but rather he focused on another theoretical prohleiii, namely that of negative 
probabilities. 

2.1. Dirac equation for a spin-1/2 electron (1928). The probability density 
for the non-relativistic Schrodinger equation was known [16] to be a positive definite 
quantity, p = \ip\'^, satisfying a continuity equation which renders the space integral 
of p to be time-independent. On the other hand, the only p which can be formed 
from the solutions of the Klein-Gordon equation and satisfying again a continuity 
equation does not have a definite sign, so that it is not possible to identify it as 
the probability density in the corresponding relativistic case. This was the major 
problem which Dirac attempted to solve in order to obtain a consistent relativistic 
theory. He then realized that the possible negative probabilities arising from the 
Klein-Gordon equation were due to the presence, contrary to what happened in the 
non-relativistic case, of a time derivative of the wave function in the expression for 
p, and this directly came out from the fact that the Klein-Gordon equation was a 
differential equation of the second order in the time variable. The dynamical evo- 
lution of the wave function should then be ruled by a first order in time differential 
equation, just as for the Schrodinger equation. In order to have a Lorentz-invariant 
theory, however, this led Dirac to assume that the relativistic wave equation to be 
found should be as well linear in space derivatives: 



where the replacements in ^ apply "The a's are new dynamical variables which 
it is necessary to introduce in order to satisfy the conditions of the problem. They 
may be regarded as describing some internal motion of the electron, which for most 
purposes may be taken the spin of the electron postulated in previous theories." [5] 
The "condition" to be satisfied is that the energy and momentum of the particle 
verify the energy- momentum relation in Eq. ([T]) or, through that the solution 
of the Dirac equation @ satisfies also the Klein-Gordon equation ([3]). Dirac showed 
that this was indeed the case provided that the "new dynamical variables" satisfy 



2. Relativistic wave equations: the known story 
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the relations: 

al = 1 

{idjiy ~ 1,2,3,4). He noted that, for these relations to be true, the a quantities 
must be 4 X 4 matrices obtained from the Pauli 2x2 matrices cr, 
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(where and I2 are the 2x2 zero and identity matrices, respectively), so that the 
wave function itself is what later will be named a 4-component spinor. Dirac then 
proved the full Lorcntz invariance of his theory: Eq. ^ could be written in the 
form 

{il^Pii + mc) ■0 = 0, (7) 
with the matrices 7^ transforming as a 4-vector (just as the 4-momcntum p^), and 
satisfying the relation: 

7m 71^ + Ivlti = 2i5^i. (8) 
(which relation is, as well, relativistically invariant). Moreover, he obtained the 
hamiltonian for an electron in an arbitrary electromagentic field and, from the 
conservation of angular momentum, showed that "his" electron actually had a value 
of the spin equal to 1/2. Finally, he also pointed out that his theory gave correct 
predictions (at order a^mc^, where a is the fine structure constant) for the fine 
structure splitting of the hydrogen energy levels, contrary to what happened for 
the Klein-Gordon equation. However, as already pointed out, Dirac's primary aim 
was that of a relativistic formalism with positive probabilities, and this was actually 
achieved; since the probability density satisfying the continuity equation was now 
just given by p = IV'P- 

2.2. Negative energy states. A second major difficulty of the Klein-Gordon 
equation, as pointed out by Dirac himself, concerned again the fact that the energy 
W in Eq. ([T]) appears to the second power. As a matter of fact, the Klein-Gordon 
equation... 

...refers equally well to an electron with charge e as to one with 
charge — e. If one considers for definiteness the limiting case of 
large quantum numbers one would find that some of the solutions 
of the wave equation are wave packets moving in the way a particle 
of charge — e would move on the classical theory, while others are 
wave packets moving in the way a particle of charge e would move 
classically. For this second class of solutions W has a negative value. 
One gets over the difficulty on the classical theory by arbitrarily 
excluding those solutions that have a negative W . One cannot do 
this on the quantum theory, since in general a perturbation will 
cause transitions from states with W positive to states with W 
negative. Such a transition would appear experimentally as the 
electron suddenly changing its charge from — e to e, a phenomenon 
which has not been observed [5]. 
This second difficulty was not removed in the original Dirac theory, as explicitly 
admitted in [5], and the subsequent three years saw Dirac working to solve this 
problem. Indeed, the quotation above should not be read - anachronistically - as 
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an anticipation of the final interpretation of the negative energy states in terms of 
positrons, that came through a tortuous path only some years later and culminated 
in the experimental observations by CD. Anderson in 1932 [B]. 

W. Heisenberg and Pauli, for example, simply discarded such a problem, regard- 
ing the negative energy states as an "inconsistency of the theory [...], which must 
be accepted as long as the Dirac difficulty is unexplained" [17]. Dirac, instead, soon 
realized that from his equation it follows that a negative energy electron moves in 
an external field as if it had positive charge and energy. This original observation 
led H. Weyl in 1929 to "expect that, among the two pairs of components of the 
Dirac [wavefunction] one pair corresponds to the electron, while the other to the 
proton" [18]. This association was further considered (for some time) by Dirac 
in the framework of his "hole theory" , with the identification of "the holes in the 
distribution of negative energy electrons" with the protons [19] , although novel dif- 
ficulties arose with such interpretation. Here we only quote from the Dirac paper 
appeared at the very beginning of 1930: 

Can the present theory account for the great dissymmetry between 
electrons and protons, which manifest itself through their differ- 
ent masses and the power of protons to combine to form heavier 
atomic nuclei? It is evident that the theory gives, to a large ex- 
tent, symmetry between electrons and protons [...] The symmetry 
is not, however, mathematically perfect when one takes interac- 
tion between the electrons into account. [...] The consequences of 
this dissymmetry are not very easy to calculate on relativistic lines, 
but we may hope it will lead eventually to an explanation of the 
different masses of proton and electron [H] . 

Such difficulty in the interpretation of the theory, and some others related to it 
(i.e., the too high rate of annihilation of electrons and protons), were later overcome 
by Dirac himself, after the Weyl proof that holes necessarily represented particles 
with the same mass as an electron [20] [21]. This novel interpretation, though 
highly controversial, was eventually confirmed by the cosmic-rays experiments by 
Anderson [B] in 1932. However, the fact that the observed positive electrons were 
indeed Dirac antiparticles was not clear for some time, but was fully recognized 
only after the appearance of the experimental results on cosmic-rays showers by 
P.M. Blackett and G.P.S. Occhialini [22], followed by a discussion of their results 
within the framework of the Dirac theory. Notwithstanding this, especially Pauli 
remained very critical with the Dirac theory of negative energy statesQ a position 
shared by another theoretician who came into play with a more general (and more 
difficult to understand) theory. 

2.3. Majorana infinite-component equation (1932). With the exception of 
Pauli and very few other people who gave some (justified) importance to the theo- 
retical problem of the negative energy states, the vast majority of physicists granted 
the success of the Dirac equation even before its "final" confirmation with the dis- 
covery of the positron. The reason was mainly the successful predictions about 
the fine structure of the hydrogen and the correct account for the magnetic mo- 
ment of the electron (with the prediction of its gyromagnctic ratio g = 2), which 



I do not believe in your perception of 'holes', even if the existence of the 'antielectron' is 
proved", wrote Pauli to Dirac on May 1, 1933 [23) . 
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were a major trouble for the just born quantum mechanics. People realized that 
such problems were intimately related to the spin of the electron, and the successful 
Dirac theory offered a reasonable and attractive way out, even well founded on solid 
formal basis. Even better, quite a general consensus prevailed that the 1/2 value 
of the electron spin was a necessary consequence of the theory of relativity, i.e. of 
Lorentz invariance j24| . Just the contrary was demonstrated in 1932 by Majorana 
[15j who explicitly built a consistent relativistic quantum theory for particles with 
arbitrary spin. 

The starting point in the Majorana's paper, however, was again the persisting 
problem of the negative energy states, a lucid examination of which directly led 
Majorana to generalize the Dirac equation to particles with arbitrary spin. The 
relativistic wave equation was assumed to be of the same form as in Eq. (j4|) , i.e. 



h Q! • p — p mc 

c 



^- = 0, (9) 



but nothing was assumed for the ct, /S matrices appearing in it. Instead, a thorough 
(mathematical) inspection about the insorgcncc of the negative energy states was 
presented. 

Equations of this kind present a difficulty in principle. Indeed, the 
operator /? has to transform as the time component of a 4-vector, 
and thus /3 cannot be simply a multiple of the unit matrix, but must 
have at least two different eigenvalues, say /3i and (32- However, this 
implies that the energy of the particle at rest, obtained from Eq. dU 
by taking p = 0, shall have at least two different values, i.e. /?i mc^ 
and j32m(?. According to Dirac's equations, the allowed values of 
the mass at rest are, as well known, +m and — m; from this it 
follows by relativistic invariance that for each value of p the energy 
can acquire two values differing in sign: W = ■iz^m'^c'^ + c^p"^. As 
a matter of fact, the indeterminacy in the sign of the energy can 
be eliminated by using equations of the type (jH]), only if the wave 
function has infinitely many components that cannot be split into 
finite tensors or spinors |15] . 

Thus, Majorana did not require the energy-momentum relation ([T]) to be satisfied 
for each component of the as instead Dirac did in order to determine the expres- 
sions for the matrices a, /3. Rather, Majorana determined a priori a representation 
for the infinite matrices corresponding to the six infinitesimal Lorentz transfor- 
mations, that is the transformation matrices for the components of the wave 
function forming a basis for that representation. The form of the a, /? matrices was 
then deduced just by imposing relativistic invariance on the action integral. The 
rest energy of the particles described by such theory is, as anticipated, positive: 

W^o = ^ , (10) 

but this reveals that Eq. ([S]) is a multi-mass equation. "For half-integer values of 
J we thus obtain states corresponding to the values m, m/2, m/3, ... of the mass, 
while for integer j one has 2m, 2m/3, 2m/5, .... It should be emphasized that 
particles having different masses also have different intrinsic angular momentum." 
Also, solutions with different energy eigenvalues are as well present in the theory: 
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"Besides the states pertinent to positive values of the mass, there are other states 
for which the energy is related to the momentum by a relation of the following 
type: W = iy^c^p^ — k'^d^; such states exist for all positive values of k but only 
for p > fcc, and can be regarded as pertaining to the imaginary value ik of the mass." 
Tachionic solutions thus enter for the first time in a relativistic wave equation. 

All these peculiarities were rediscovered and appreciated only years later (see 
below): the physical and mathematical content of the Majorana theory was, evi- 
dently, too much ahead of his time, during a period when people became convinced 
that spin 1/2 particles were enough for Nature, and that such a result directly 
followed from Loreutz invariance through the simple Dirac equation. The situation 
didn't change so much even when, in 1934, Pauli and V.F. Weisskopf [25] succeeded 
in "solving" the problem of the negative energy states for the Klein-Gordon equa- 
tion (namely, by "second-quantizing" this theory), thus showing that the marriage 
between quantum mechanics and special relativity did not necessarily require a 
spin 1/2 for the correct interpretation of the formalism, as erroneously believed. 
Thus, non intrinsically wrong arguments were proved to lie in the Klein-Gordon 
theory that justified the derivation of the Dirac theory; such theories just applied 
to particles with different spin, as already deduced by Majorana himself [7] . 

The mathematical relevance of the Majorana's paper should not be underesti- 
mated as well. Remarkably, he indeed obtained the simplest infinite-dimensional 
unitary representations of the Lorentz group that were re-discovered by E.P. Wigner 
in his 1939 and 1948 works [26|. It is quite interesting also to dwell on the extremely 
simple reasoning which led Majorana to consider the infinite-dimensional represen- 
tations, easily recognizable in his personal notebooks (see page 446 in Ref. [12]): 

The representations of the Lorentz group are, except for the iden- 
tity representation, essentially not unitary, i.e., they cannot be con- 
verted into unitary representations by some transformation. The 
reason for this is that the Lorentz group is an open group. How- 
ever, in contrast to what happens for closed groups, open groups 
may have irreducible representations (even unitary) in infinite di- 
mensions. In what follows, we shall give two classes of such rep- 
resentations for the Lorentz group, each of them composed of a 
continuous infinity of unitary representations. 

As well interesting is the acknowledgment by Wigner of the Majorana's work, typ- 
ical of a rigorous mathematician: 

The representation of the Lorentz group have been investigated re- 
peatedly. The first investigation is due to Majorana, who in fact 
found all representations of the class to be dealt with in the present 
work excepting two sets of representations. [...] The difi^erence be- 
tween the present paper and that of Majorana [...] lies - apart from 
the finding of new representations - mainly in its greater mathe- 
matical rigor. Majorana [...] freely uses the notion of infinitesimal 
operators and a set of functions to all members of which every 
infinitesimal operator can be applied. This procedure cannot be 
mathematically justified at present, and no such assumption will 
be used in the present paper. Also the conditions of rcducibility 
and irreducibility could be, in general, somewhat more complicated 
than assumed by Majorana [26] . 
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The greater mathematical rigor by Wigner certainly justifies the 56 pages of his 
paper, but the answer-at-a-distance by the physicist Majorana is illuminating: "in 
order to avoid exaggerated complications, we will give the transformation law only 
for infinitesimal Lorentz transformations, since any finite transformation can be 
obtained by integration of the former ones." Such an approach is still used by 
practically any physicist. 

Despite such simplicity of reasoning, this Majorana theory was apparently not 
appreciated by many, given its very few traces in the literatureo apart from Wigner, 
for what we are concerned with here it was quoted only in [57| [53] [55] for the period 
of time considered. Nevertheless, this is only partially true (especially for Pauli, 
see below), since much of the ideas underlying it and even results obtained by him 
will be rediscovered and reconsidered later by other people in different times, as we 
will see in the following. 

2.4. Majorana-Oppenheimer formulation of electrodynamics (1931). Af- 
ter the success of the first-order Dirac equation in describing a relativistic quantum 
particle, a quite peculiar belief made its way that the "wave" properties of a (neu- 
tral) field be described by a (second-order) "wave" equation as in the Klein-Gordon 
case, while the "particle" properties be instead described by a (first-order) "par- 
ticle" equation as in the Dirac case. While this belief was latent in the minds of 
physicists in the 1930s, and the question was properly addressed only with the 
clear emergence of quantum field theory, the words by N. Kemmer still in 1939 are 
unambiguous: 

In the case of particles with an electric charge [...] the transition 
to the limiting theory of a classical particle is possible. 

On the other hand, for any uncharged particle [...] a limiting 
classical particle theory does not exist. 

Conversely, one finds that, at least in the Bose case, a complete 
correspondence of the theory of uncharged particles with a classical 
wave theory exists, whereas the correspondance appears to fail for 
charged particles. The latter fact can well be understood, for a 
classical entity corresponding to the quantum mechanical "charged 
field" is hard to envisage. 

For the special case of the meson we thus have the following 
peculiar situation: although it seems likely that both charged and 
uncharged mesons exist, and the quantum treatment of the two is 
well nigh identical, the uncharged one [...] is classically a true field, 
the uncharged one, on the other hand, a particle [57] . 

The quite obvious conclusion was that such a situation "is neither justified by 

experimental considerations nor by arguments of correspondance." 

The Kemmer's own solution of this issue, initially developed for the particular 

case of mesons, will be described below, while we now dwell a bit on the earlier 

consideration of the problem. 

^This is related also to the fact that it was publisiied in a not internationally renowned journal. 
As already noted by Fradkin [9], "Science Abstracts (Section A, Physics), the English language 
abstracting service, did not abstract from Nuovo Cimento until 1946. Majorana's article was given 
a several line abstract by the contemporary German abstract service [Physikalische Berichte, 1933- 
I, p. 548] but the abstractor, whose major field was fluorescence of salts and crystal studies, failed 
to assess its significance or even mention the occurrence of the infinite-dimensional representations. 
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The first "particle" for which it was noted such an asymmetric description was 
the photon. It is described as a "classical" field by the second order D'Alembert 
wave equation deduced by the Maxwell equations, but it was J.R. Oppcnheimcr 
[29j who first considered the problem of finding a first-order "particle" equation for 
it: "our present problem is to find the wave equation for the de Broglie waves of 
the [light] quantum." The "equation of the retarded potentials" 

□V = 0, (11) 

indeed, according to Oppcnheimcr is "in several respects unsatisfactory. Here, just 
as for the electron, we should want a linear equation, in order to obtain a suitable 
density flux vector with vanishing divergence." In the much simpler (and indepen- 
dent, though not published) formulation by Majorana |30| . such a problem is just 
to translate the (first-order) Maxwell equations into a Dirac-like form for a suitable 
photon wave function. Oppenheimer showed that such a wave function "no longer 
behaves as an invariant under space rotations, but as a spinor of the first rank" , 
and, in particular, a 3-component "spinor" (that is, a vector) is required. By taking 
ip = E — icB, where E, B are the electric and magnetic field of the associated elec- 
tromagnetic wave, the four Maxwell equations (in vacuum) can indeed be written 
in the form of a Dirac equation: 

— -a-pU = 0, (12) 

with the necessary transversality condition p ■ = 0. The mass of the particle is, 
of course, zero. The quantities a are 3x3 hermitian matrices given by 






ai = i \ , a2 = I | , ^ \ -i \ , (13) 

V -i / 

which satisfy the relations 

[ai,aj] = -ie^jk at ■ (14) 

The probabilistic interpretation of such a theory is indeed possible given the con- 
tinuity equation satisfied by the wave function ip, that equation being just the 
translation into the actual language of the Poynting theorem for the electromag- 
netic field (see the last reference in [30]). However, as recognized by Oppenheimer 
himself, the present theory for the photon is unsatisfactory for several reasons, the 
main one concerning the non-explicit Lorentz invariance, given the employment of 
a 3- vector instead of a covariant 4- vector. Nevertheless, the novel basic concepts 
introduced will be longlasting: 

(1) a directly observable wave function (related to the electric and the magnetic 
fields, rather than to the electromagnetic potentials) is worth of considera- 
tion; 

(2) the spinor space is not necessarily that introduced by Dirac, a different 
dimension and algebra (as given in (fT4|)) being possible. 

The discussion of the first point, though notably interesting, is out of what con- 
sidered here, and will not be further discussed. Instead, some effort was devoted 
since 1931 to the second point, as already introduced in the above discussion of 
the Majorana infinite-component equation, and will be the subject of the following 
paragraphs. 
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2.5. de Broglie theory of a "composite" photon (1934). The starting point 
of the Majorana-Oppenheimer theory was mainly theoretical in nature, being in- 
spired to the desire of "translating" the Maxwell equations into a Dirac-like form. 
This point was further considered in 1934 (without reference to Oppenhcimer) by 
dc Broglie, who started from a full physical inspiration: 

Nous avions propose de considerer le photon eomme forme par deux 
corpuscoles complementaires qui soinet I'un par rapport a I'autre 
comme I'electron positif par rapport au ncgatif et nous avions rat- 
tache a cette conception unc definition des champs electromagnctiqucs 
lies au photon [21]. 

Here, the original idea behind the theory is evidently related to the known process 
of electron-positron annihilation into photons, which could easily lead to the (er- 
roneous) belief that the photon was effectively composed of two of such particles0 
with the implicit prediction of a massive photon. In order to achieve his proposal, 
dc Broglie needed to introduce "une equation des ondes pour le photon au lieu de 
partir, ainsi que nous I'avions fait, de lequation de Dirae pour le demi-photon." 
The de Broglie equation for the photon had just the same form as that of a Dirac 
equation, 

c ot ox oy oz h 

where /io is the mass of the "semi-photon" . Here the interesting novelty is the 
introduction of the four 16 x 16 matrices obtained as products of different 
Dirac matrix spaces (explicitly reported in j31j ) in accordance with the original 
idea envisaged above. Indeed, as de Broglie himself realized, such matrices did not 
follow the Dirae algebra ([5]), though the component a matrices did. As we will 
see below, it was a student of de Broglie, G. Petiau, who again worked on this 
16 X 16 matrices and became the first to discover the so-called DKP algebra |33) . 
For the moment, however, we only mention an interesting property of one possible 
solution of the de Broglie equation, namely the solution which "correspond a une 
energie, une quantite de mouvement et un spin nuls. Elle est done appropriee a la 
representation de I'etat d'annihilation du photon." 

de Broglie considered his hypothesis (the so-called " neutrino theory of the pho- 
ton" ) for more than 40 years [8] , despite the experimental evidence against it (the 
basic prediction is a non-vanishing mass for the photon which, however, was not 
observed). 

2.6. Dirac-Fierz-Pauli generalized equations for spin higher than 1/2 
(1936-9). The appearance of the Dirae equation gave for the first time an example 
to physicists of a system of equations which is invariant in form when subjected 
to a Lorentz transformation, but which is not written in terms of tensors. Indeed, 
though not supported by mathematicians, physicists tacitly assumed that the ordi- 
nary tensor language did comprise all possible representations of the Lorentz group, 
while Eq. ([7]) explicitly showed this is not true. Spinor calculus was worked out 
basically by B. van dcr Waerden [34], who also showed how write the Dirac equation 
in an automatically covariant form. Though particularly difficult for the physicists 
of the time uninitiated to this mathematical machinery, such a powerful instrument 



■^Note that the successful Fermi theory of nuclear beta decay 1321 , where a similar problem was 
correctly addressed, was contemporary to the paper by de Broglie. 



SEARCHING FOR AN EQUATION: DIRAC, MAJORANA AND THE OTHERS 



11 



was completely adopted by Dirac (and few others) who succeeded in writing down a 
generalization of his earlier equation to particles with integral or half-odd integral 
spin greater than 1/2. It connects two irreducible spinors; for a particle of spin 
n + 1/2 and mass m; in spinor notation it was written as follows: 

. 4d/3i/32.../3„ _ n/3i/32.../j„ 

(16) 

The component wave functions A and B are each completely symmetric in their 
dotted and undotted spinor indices, and p-ya is the momentum operator written 
as a covariant spinor. For n ~ 0, Eqs. are equivalent to the Dirac spin-1/2 

equation and the two spinors A", Bj transform like a Dirac bi-spinorQ The basic 
requirement considered by Dirac that led to Eqs. (|16p is that each component of 
the wave function sarisfics the Klein-Gordon equation ([3]), just as in the Dirac spin- 
1/2 equation. Indeed, either A ot B can be eliminated from the equations, thus 
remaining with a second-order equation ^ satisfied by each component separately, 
provided that the following condition is applied: 

plB^t.-X=0 (18) 

(and similarly for A). Actually, such a condition effectively holds, since it can be 
deduced, for instance, by contracting the indices a and /3i in the second of Eqs. 
([T6| . so that the original aim by Dirac was fulfilled. 

Apart from the finite number of components considered in the Dirac theory, 
the requirement that Eq. ([3]) holds for any component of the wave function is 
a remarkable difference with respect to the Majorana theory of 1932, since it is 
physically equivalent to require that the particle described by the wave function 
have (in each case) only one value of the mass - except for the sign -, contrary 
to what happened in the Majorana theory. The same is true for the spin: each 
equation describes a particle with only one spin state. 

The problems with the negative energy states were solved by M. Fierz in 1939 
[35j just in the same way as for the Dirac spin 1/2 equation and for the Klein- 
Gordon equation, that is by setting up a scheme of second-quantization for the 
theory (in the absence of an external field). This led Pauli to consider (again 
with Fierz) the obvious subsequent case of second quantization when an external 
electromagnetic field is instead applied |36] , but here a surprising fact came out even 
without considering at all field quantization. In fact, Fierz and Pauli discovered that 
"the most immediate method of taking into account the effect of the electromagnetic 



spinor with one dotted and one undotted index transforms like a 4- vector 1341 so that, by 
pairing $i and ei, one could replace n dotted and n undotted spinor indices of both A and B 
by a n symmetric traceless 4-vector indices. Therefore, Eqs. I I16I I could be written in the same 
formalism as in Eq. (|4]l with a wave function having, beside the bi-spinor index, n symmetric 
traceless 4-vector indices: 

(^^S'''' + afp, + atmc^i^t^,^,„,^=0. (17) 

However, since the powerful mathematical formalism offered by spinor calculus, though somewhat 
more involved, plays not at all a secondary role in the theory originalUy proposed (as well as in 
subsequent re-considerations), we prefer to draw the reader's attention to Eqs. I I16I I rather than 
to JlTll- 
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field, proposed by Dirac (1936), leads to inconsistent equations as soon as the spin 
is greater than 1." 

The interaction with an electromagnetic field was usually introduced by replacing 
the rules in ([2]) with the standard ones in the minimal coupling scheme: 

W ih d » 

>— — + eip, p^-iW + eA, (19 

c c ot 

where (ip, A) are the electromagnetic potentials. The problem pointed out by Fierz 
and Pauli derived from the subsidiary condition in Eq. (|18p . Indeed, while the 
equations in can be deduced from a variational principle (that is, a lagrangian 
function may be written leading to those equations, as in the Dirac 1/2 case or in 
the Klein-Gordon case or even in the Majorana case), from what pointed out above 
it is clear that this does not happen for the subsidiary condition ([T8| . since it is 
derived directly from the equations of motion (|16p . This was not a really relevant 
point for the field-free case, but the contrary is true when the replacements (|19p 
were applied directly to Eq. (fT8|) (and not in the lagrangian function), since the 
spinor fields should satisfy additional constraints that "cannot in general be satisfied 
simultaneously with the other equations." Fierz and Pauli solved the problem by 
deducing a lagrangian function from which both Eqs. ([T5)) and can be obtained 
by a rather complicated procedure. "This consists in introducing auxiliary tensors 
or spinors of lower rank than the original ones [...] and deriving all equations 
from a variation principle without having to introduce extra conditions" [36] . The 
arbitrary constants present in the lagrangian function were adjusted in a way that, 
in the free case, the equations of motion led to the identical vanishing of the auxiliary 
tensors/spinors, while Eqs. and ((T5)) held true, and the interaction with an 

electromagnetic field could then be introduced by the standard procedure in pOI) . 
The "artifice" employed by Fierz and Pauli, however, was not very elegant since, in 
the presence of interaction, the auxiliary spinors did not vanish identically, though 
there were just so many equations that for any given value of the momentum of 
the particle there were the right number of independent states corresponding to 
the spin of the particle at hand. Thus, the procedure worked fine, but became 
progressively more complicated for larger spins. 

2.7. Proca equation for massive spin-1 particles (1936). The de Broglie's 
idea of a composite photon influenced for some time physicists in the 1930s, espe- 
cially those who worked in France. This was the case of A. Proca, a Romanian 
theoretician whose doctoral advisor was de Broglie himself. In 1936 [37] he in- 
terpreted the two (complex conjugated) terms in the plane wave expansion of the 
photon field in quantum electrodynamics as the "superposition" of two elementary 
particles (as happened for the quantized Dirac electron-positron field) following the 
de Broglie original idea: 

ces particules ont meme energie {positive) , meme quantite de mou- 
vement et meme spin, mais des charges, es courants et des moments 
electromagnetiques opposes. L'important est le fait qu'eZ/es ne sont 
plus des neutrinos, mais des particules chargees, de masse nulle; ce 
sont en quelque sorte des charges pures [37] . 

Thus, according to Proca [38], "la theorie de la lumiere entre par cette voie dans 
le cadre dune mecanique quantique generale qui englobe notamment les electrons 
negatifs et les positons." The basic concept he introduced was that of "pure charge" 
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particles, with a zero rest mass (roughly speaking, massless electrons and positrons), 
which were assumed to be the two components of the photon. This served to Proca 
to describe the photon by means of a 4-componcnt wave function ipr (where, how- 
ever, r is a Lorcntz index), each component of which being treated as independent 
scalar wave functions. Here, although Proca always took the limit of zero mass for 
the application to the photon, he nevertheless considered the general case of a mas- 
sive spin-1 particle. In this case, each component had to satisfy the Klein-Gordon 
equation (for a free particle): 

□ -0^ = fc2 (20) 

(fc = mc/h). However, Proca showed that, in order to have a positive energy 
(and, more in general, a positive definite energy density at every space point), the 
supplementary condition (and its complex conjugate) 

S'^V'r = (21) 

must hold. 

A meaningful reformulation in terms of first-order rather than second-order dif- 
ferential equations (in analogy to the "translation" of the Klein-Gordon equation 
into the Dirac equation) was as well given. Indeed, inspired by Maxwell electrody- 
namics, a skew-symmetric tensor 

Grs = drijjs - dslpr (22) 

(and its conjugate) was introduced, the equation of motion for which are just gen- 
eralizations of the Maxwell equations to non-zero photon mass: 

a'"G^,s = fc2 Vs. (23) 

From these, it immediately followed that the equations satisfied by the wave com- 
ponents ipr wrote as 

D^r-drid'i:s)^k^i^r, (24) 
but from p3p it followed as well, by simply using the skew symmetry of Grs, that 
the condition (|2T|) holds, so that Eq. ((20|) was recovered. 

The Proca equations ([23]) (or the equivalent forms ([SO]). (f2T|) ) can be obtained 
from a variational principle with an appropriate lagrangian function that is a thor- 
ough generalization of the Maxwell electrodynamic lagrangian, so that the intro- 
duction of the interaction with an external field is straightforward by means of 
the usual minimal coupling principle. Here, however, the very interesting thing, as 
pointed out clear by Pauli few years later, was that gauge arbitrariness for the field 
Ipr no longer exists, contrary to the case for the massless photon described by the 
4-potential A^: Vr "is uniquely defined by a given [Grs], just as [Grs] is defined by 
[ipr] from [ (j22p ]. As a consequence, for non- vanishing rest mass, the addition of a 
gradient to [ipr] is not permitted. Hence no gauge transformations of the second 
kind exist for the [ipr]" when m ^ [39] . 

2.8. Majorana "symmetric" equation for spin-1/2 fermions (1937). In 

1937 Majorana published a paper [301 containing a theory already elaborated some 
years earlier |13| . Here the main aim was not that of writing down a novel equation, 
but rather that of reformulating the existing Dirac equation ([4]) in order to achieve a 
complete "symmetry" between the electron and positron components described by 
it. Already in 1933 Heisenberg noted the substantial symmetry, in the Dirac theory, 
between processes involving electrons and those involving positrons [41], and even 
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some time earlier Heisenberg himself elaborated an interesting application in which 
he considered the symmetry between holes and electrons in an occupied atomic 
level or in an occupied energy band of a crystal j42j . However, such a general idea 
of a particle-antiparticlc symmetry was formally developed into a consistent theory 
only by Majorana in his most famous article on a symmetric theory of electrons and 
positrons |40j . The equation considered by him is just the Dirac equation written 
in the form: 

i|-a.V-z/3^1 V = 0, (25) 
cot ft J 

but here Majorana introduced a choice for the four independent cc, /3 matrices 

different from the standard Dirac's one, namely 

ax = 'Si cFx, Qfy = (g) 1, a^^axScr^^, P^-a^Say, (26) 

where <t = {(Jx,cFy,'7z) are Pauli matrices. Such a choice led to profound implica- 
tions on the theory, since ax,o:y,az and —i(3 all have only real elements, so that 
Eq. (|25p is an equation with real-valued coefficients. The bispinor field ip can, 
of course, again be decomposed into a real and imaginary part, = U + iV, as 
in the Dirac theory, but now the separate equations for U and V are completely 
identical. Thus, in the quantum description of charged particles, the present theory 
was completely symmetric with respect to particles and antiparticles. Majorana 
was aware of the fact that such an advantage was purely formal, since there was no 
distinction between the two theories in physical applications (but with the impor- 
tant result that the cancellations of infinite constants, relative to single field modes, 
is required by the symmctrization of the theory). However, Eqs. (|25p . p6|) had in 
addition a different solution not present in the Dirac theory, that is a real solution 
ip = U, without the introduction of the V field. In this case, the theory described a 
chargeless particle or, rather, according to Majorana's own words, Eqs. p5|) . ((26)) 
constituted "the simplest theoretical representation of neutral particles", without 
the need of antiparticles. Majorana then provided also the necessary formal devel- 
opments aimed at giving a solid field-theoretic basis to Eqs. (|26p , which were 
derived from a variational principle by means of an appropriate lagrangian function 
(containing only the U field) . As showed earlier by Dirac and Pauli- Wcisskopf, this 
was not a trivial task, but Majorana, guided just by mathematical elegance and 
symmetry, succeeded to make the idea that spin- 1/2 particles could be their own 
antiparticles theoretically respectable, that is, consistent with the general princi- 
ples of relativity and quantum theory, as already known for the photons. This was 
acknowledged by several people just after the appearance of Majorana's paper, in- 
cluding Pauli, who appreciated both "the procedure of Majorana" [36] (that is, the 
field-theoretic derivation) and the "decomposition with respect to charge conjugate 
functions" jS^Q However, as in the case of other writings of his, the "Majorana 
neutrino" theory too started to gain prominence only decades later, beginning in 
the 1950s. 

2.9. Kemmer equation and the DKP algebra (1939). The successful effort 
to explain at least some of the features of nuclear forces made by H. Yukawa in 
1935 |44] allowed physicists to deal seriously with novel particles different from the 
known spin- 1/2 ones (proton, neutron, electron and positron). Indeed, Yukawa 
showed that the short-ranged interaction between two nucleons was mediated by a 



^Other people included Kemmer (27], Wigner [26] and F.J. Belinfante | 43| . 
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boson particle with mass intermediate between those of the proton and the electron, 
which could be termed "meson" 1^ In 1938 it was definitively proved that a particle 
of this type was present in cosmic rays j46j . but occurred about ten years f47| to 
recognize that the meson discovered in 1937-8 was different from that hypothesized 
by Yukawa. Nevertheless, both the success of the Yukawa theory and the observa- 
tion of mesons in cosmic rays gave quite a strong impetus to the search and study 
of possible equations describing mesons. At that time, no firm experimental infor- 
mation existed about the spin of both charged and neutral mesons, and as far as 
Yukawa's theory was concerned it could be either or 1. Now, while equations for 
both spin-0 (Klein- Gordon) and spin-1 (Proca) already existed, as well as general 
equations in generalized spinor notation (Dirac-Fierz-Pauli), in 1939 Kemmer de- 
veloped a theory based on a "novel" equation, whose good fortune lasted for several 
years. The reason is that it was becoming clear [48] that the low-energy nuclear 
interactions were due to pseudoscalar (spin 0) and vector (spin 1) mediators, and 
the Kemmer equation described them both. 

The definitive formulation of the Kemmer equation has behind it an interesting 
historical development (reconstructed in Ref. [8]), which involved several people 
working on different subjects. The story starts with the already mentioned Petiau 
who, in 1936 [33] . looked at a modification of the algebra of the matrices appearing 
in the de Broglie equation ([15]), thus discovering the algebra satisfied by the four 
16 X 16 matrices introduced: 

/3m/3./3a + /3a/3./3^ = + PxS^,. (27) 

However, Petiau's work remained practically unknown to almost everyone for many 
years, and the same happened to a paper by J. Geheniau [49| who, two years 
later, decomposed such algebra into the ten-, five- and (trivial) one-dimensional 
representations. Meanwhile, Kemmer studied the second-order Proca equations, 
and found [50] that they could be described as a set of coupled first-order equations, 
which he wrote down along with the corresponding equations for the spin-0 case. 
He then realized that such equations could be written in 10 x 10 (for spin 1) and 
5x5 (for spin 0) matrix forms, though he did not recognize the algebra satisfied 
by them. Some time after, the mathematician R.J. Duffin ran into the paper by 
Kemmer (during a seminar), and re- written jSlj both the spin-0 and the spin-1 
equations into a first-order matrix formulation with the /? matrices appearing there 
satisfying the algebra in ([ST)) (though he did not consider one of the four constituent 
commutation relations in p7|) , namely that with X = fi ^ v). "When Kemmer saw 
the note that Duffin published on his results, he wrote to Duffin saying he knew how 
to extend the theory but would first wait for Duffin to publish anything further. 
However, Duffin was by then involved in a collaboration with A.C. Schaeffer on 
function theory, and so wrote to Kemmer to go ahead. Kemmer then quickly put 
together all he had been doing, and produced his classic 1939 paper" [8]. 

The main purpose of Kemmer was that of re-formulating the Proca equations 
(j24p in order to obtain first-order wave equations without using the tensor form in 
(|23p . the theory being suitable to "be developed in strikingly close correspondence 

^As funny noticed by R. Peicrls in 1939 145 1 , "none of the properties of the new particle seems 
to give rise to more controversy than its name. After the names U-particle, x-particle, heavy 
electron, yukon, barytron, dynatron, mesotron, mesoton and meson, and, for the neutral particle, 
neutretto, had been used by different authors, the choice seems now to lie between mesotron and 
meson, of which I shall adopt the latter." 
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to Dirac's electron theory" . The equation he wrote down was, then, the foUowing: 

d^'P^^ + ki; = (28) 

{k = mc/h), where the only asumption about the four matrices (following 
Dirac's original reasoning) was that they satisfy the algebra in (P7|) . He also gave 
the explicit form of these matrices in the three incquivalent (ten-, five- and one- 
dimensional) irreducible representations of the related algebra and, probably unex- 
pected, he discovered that Eqs. (|28|) described both spin-1 and spin-0 particles: 
the ten-row representation simply leads to the usual theory based on 
Proca's (1936) equations, in which the wave function consists of four 
components forming a 4- vector and six forming an antisymmetrical 
tensor; the five-row one to the Klein-Gordon or so called "scalar" 
theory, in which the wave function consists of a scalar and its 4- 
gradient [27]. 

The present theory was, then, viewed as a generalization and a unification of pre- 
vious theories and, as such, a comparison with the more general Dirac-Fierz-Pauli 
formalism discussed above was claimed. Kemmer realized that it could be carried 
out easily if an hamiltonian formulation was as well given, and succeeded in pro- 
viding it by obtaining the appropriate hamiltonian function from which Eq. ()28|) 
can be deduced: 

ch 

H = —dk {PkPA - PiPk) + mc^pi. (29) 
I 

The conclusion was that "Dirac's hamiltonian formulation shows clearly that the 
differences of his theory compared with the present one are merely due to details of 
representation." Nevertheless, Kemmer's formulation proved more simple to handle 
than the difficult spinor formalism and, due to the nuclear physics observations 
mentioned above, it enjoyed a certain consideration among physicists for some time. 
Starting from early 1950s, when it became clear that the observational particle 
situation get more involved than in the late 1930s, physical interest in Kemmer's 
equation diminished: why use, for example, a "complicated" five-component spinor 
equation when we can use a one-component equation (albeit second-order) much 
easier to handle and leading to the same results? The situation changed a bit in 
1970s (see |8] and references therein), when the problem of the equivalence between 
first-order and second-order equations was considered again, finally realizing that 
the equivalence holds true only for free particles, while false for interacting particles. 

2.10. Further developments in 1939-1942. After several proposals of relativis- 
tic wave equations, it was time to reason on what had been achieved and to clarify 
some issues. 

In this respect, a first important mathematical paper by Wigner (already men- 
tioned) appeared in 1939 |26| . dealing with unitary representations of the inhomo- 
geneous Lorentz group. If the wave function describing the possible states of a free 
particle satisfies a relativistic wave equation, then a correspondence exists between 
the wave functions describing the same state in different Lorentz frames. These 
transformations form the group of all inhomogeneous Lorentz transformations, and 
a classification of all unitary representations of the Lorentz group amounts to a 
classification of all possible relativistic wave equations. Wigner then gave a clas- 
sification of such unitary irreducible representations, together with a prescription 
for their explicit construction, a remarkable result of his analysis being that every 
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irreducible wave equation is equivalent to a system of differential equations (see, 
however, also the discussion above about Majorana's work of 1932 [T5|). 

On a more physical side, it is noteworthy the general talk by Pauli at the Solvay 
Congress of 1939 (and published in 1941) [32, where the theoretical physicist gave 
a lucid review (with further insights) of relativistic theories for spin-0, spin-1/2 
and spin-1 particles, including the "special synthesis" of Kemmer equation. The 
approach is by then field theory oriented, with a variational principle based on 
the lagrangian formalism as starting point. The report ended with an analysis of 
some physical applications of the theories discussed dealing with the interaction of 
particles with spin 0, 1/2 and 1 with the electromagnetic field. A special mention 
deserves the harsh criticism made by Pauli, already timidly expressed by Kemmer 
[27] . about the de Broglie theory of composite photons: "on the basis of the inter- 
pretation of this paper, however, the de Broglie theory does not describe photons 
at all, but rather is a unified description of two particles with equal non-vanishing 
rest-mass, with spin values and 1." 

A remarkable extension of Dirac equation to higher spin particles, in the spirit 
(but different formalism) of Kemmer equation, was that of F.J. Belinfante in terms 
of "undors" [52]. Belinfante assumed particles with spin N/2 to be described not by 
spinors but by Dirac wavefunctions tp{^: Cii ■ • ■ : Cn) depending on N 4-componcnt 
variables Cr, on which act the Dirac matrices 7^. The wavefunction ip is assumed 
to be symmetric in all the (r and to satisfy Eq. ([28]) with 

1 ^ 

^'=2T.^r- (30) 

r=l 

Such wavefunctions, products of Dirac wavefunctions (so as to include spatial reflec- 
tions) were called undors by Belinfante, who considered them as generalizations "of 
Dirac wavefunctions in the same sense as tensors form a generalization of vectors." 
From the symmetry of ip, however, it turned out that the Belinfante equation sat- 
isfied by undors is equivalent to N identical Dirac equations with a rescaled mass 
(with a factor 2/N) for each set of 7^. The work by Belinfante was, then, a differ- 
ent (with respect to Dirac-Fierz-Pauli theory) generalization of the original Dirac's 
theory without making recourse to spinor calculus, but with the same problems en- 
visaged above regarding the introduction of auxiliary conditions when interaction 
with external electromagentic fields is included. 

The original Dirac-Fierz-Pauli spinor formalism was, instead, considered in 1940 
by G. Gentile |24j . who obtained a general expression for an invariant operator 
from which any relativistic equation for arbitrary spin, in the Dirac spinor form, 
could be deduced (in the Dirac-Fierz-Pauli theory, the corresponding equation was 
directly wrote down heuristically). In particular, he applied the result obtained to 
get the relativistic equation for the by then fashionable spin-1 mesorQ: 

^'"^ . i'f' 31) 



^Fierz 1351 considered only implicitly such a case, while Ficrz and Pauli 1361 considered only 
the s = 2, 3/2 cases. 
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with the auxihary condition (aUowing B'^^ to have only three, rather than four, 
independent components, as for the symmetric A'^"): 

p^^'Bl-p^t'B'^^O. (32) 

The problem of subsidiary conditions was further investigated by the Japanese 
physicists S. Sakata and M. Taketani in 1940 [53], who produced a different (and 
physically meaningful) formulation of the Kemmer equation. By use of what is 
known as a Peirce decomposition [53], they were able to separate out the 2 (2s + 1) 
(for particle and antiparticle) components of the Kemmer equation into one din- 
stinct hamiltonian equation. The remaining components (essscntially the built-in 
subsidiary conditions) were in a distinct equation that had to be satisfied simultane- 
ously in order to obtain a covariant description. According to the Sakata- Taketani 
decomposition method, then, the five- and ten-component equations for spin-0 and 
spin-1 particles reduced to two- and six-component equations, respectively ("parti- 
cle components"). 

Further consideration of the Kemmer equation, as a mean to obtain a general 
equation for arbitrary spin, led J.K Lubanski in 1942 [54] to propose the S0(5) 
algebra for a first-order wave equation, inspired by previous collaborations with 
Bclinfante: 

{T^d^ + Nk)tl) = Q. (33) 

The matrices were again built on from the Dirac 7 matrices, and Lubanski 
showed that the ten matrices given by (i/2)F^, (l/4)[Fi,, Fa] represented the oper- 
ators for the infinitesimal rotations in a five-dimensional space. The quantity N in 
(|33p was an integer number determining the number of components of the wave- 
function -0 (having 4^ components), written as products of components of Dirac 
wavefunctions. For A'^ = 1, Eq. ()33|) reduced to the ordinary Dirac equation for the 
electron, while for N = 2 the Kemmer equation was recovered in the form given by 
Belinfantc. Lubanski proved that for > 2 the representations for F^ became re- 
ducible: "on pent dire alors que I'Eq. [ p3|) ] ne decrit plus une seule particule mais 
une superposition de particules" , as for the Kemmer equation. However, "dans le 
cas de A^ > 3 la situation est ancore plus compliquee. Les particules decrites par 
I'Eq. [p3|)] out non seulement differents nombres quantiques de spin mais aussi 
differentes masses." Lubanski, indeed, recognized also the type of spin and mass 
spectrum which would be obtained. 

2.11. Rarita-Schwinger equation for spin-3/2 particles (1941). In 1941, W. 
Rarita and J. Schwinger |55j again considered the problem of "simplifying" the com- 
plicated spinor formalism of the Dirac-Fierz-Pauli for half-integral spin, by treating, 
in particular, the special case of spin-3/2 particles (already studied explicitly by 
Fierz and Pauli |36|). These are described by a spinor field V''' with an extra vector 
index, and Rarita and Schwinger succcded to writing down a "simple" lagrangian 
function that "can be constructed without the intervention of additional fields" , as 
instead in the Ficrz-Pauli case, namely: 

L = -4^ (Yd, + k) {%.d, + lAd r + \ Tl^. (Ydr - k) j^r. (34) 
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From the Euler-Lagrange equation for such a lagrangian, the fohowing equation 
can be deduced (not reported exphcitly in the paper): 

- (Ydr + k) - \ (7^5. + l.d^) r + J 7m il^dr - k) = 0. (35) 

Of course, as pointed out by the authors in the general case, additional spurious 
components are present in the 16-component vector-bispinor wave function ^/;^. and 
subsidiary conditions are required even in the "simpler" formalism. 

The (short) paper ended with the consideration of the zero-mass limit: 

In the exceptional case of zero rest mass the wace function admits 

a gauge transformation, 

which leaves all physical quantities invariant. 

The method here presented for developing the theory of spin 3/2 
thus contains many of the features of both the Proca and the Diraac 
theory |55| . 

Only much later the appearance of the Rarita-Schwinger theory, it was discovered 
[56j that a subtle inconsistency was present when the interaction with an external 
potential is introduced, that is the solutions of the equation propagate at velocities 
exceeding the speed of light (for arbitrarily weak external fields), thus violating the 
postulates of special relativity. 

2.12. Bhabha general equation for arbitrary spin (1945). The intricate issue 
of a relativistic wave equation for arbitrary spin was reconsidered, after Dirac-Fierz- 
Pauli, by H.J. Bhabha in 1944-5 [57| [58] from a general point of view. The simple 
form of the basic equation was assumed to be the following: 

{pka'^ -t- X) ^ = 0, (36) 

where pfc — id/dx^ and a*^ are four square matrices whose degrees and commutation 
rules depend on the spin of the particle considered (x is a constant). 

The work of Bhabha, however, was not limited just to assume a given form of a 
general equation, but rather his lucid analysis was aimed to find general conditions 
to be satisfied when describing a particle of arbitrary spin: 

In order to develop equations for higher spin values one must find 
some general principles common to all of them. These are: 

A. It can be deduced from the equations that each component of the 
wave function satisfies the second-order wave equation [([21)] • This 
is physically equivalent to the statement that the particle described 
by the field has in each case only one value of the rest mass (except 
for sign). 

B. The particle- field is completely described by an equation of the form 
[(|36|)] without the help of any further subsidiary conditions [57] . 

The main problem with spin higher than 1 was that the corresponding theory 
could not satisfy both properties, and this was very well illustrated by considering 
the known Dirac-Fierz-Pauli theory: 

The DFP equations connect two irreducible spinors, and by a suit- 
able transformation can be split into two sets, one of which still 
connects the two irreducible spinors together, while the other set 
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only involves one spinor and is in the nature of subsidiary condi- 
tions. We shall see below that the first set can be written in the 
form pop but without the second set the first set is not equivalent 
to the DFP equations. The second set, consisting of the subsidiary 
conditions, is necessary in order that it should be possible to derive 
a second-order wave equation for each component |58| . 

The removal of the inconsistencies related to the subsidiary conditions, as we have 
seen above, required the introduction of additional subsidiary spinors [36] , and this 
appeared to Bhabha as a loss of "elegance" of the given mathematical formulation. 

It would, therefore, appear to be more logical to assume that the 
fundamental equations of the elementary particles must be first- 
order equations of the form [ i36]) l and that all properties of the 
particles must be derivable from these without the use of any further 
subsidiary conditions |58| . 
The price for such a choice was that each component of the wave function did 
not satisfy the Klein-Gordon equation ([3]) and, as a consequence, the particle has 
states of higher rest mass which "are an essential feature of the theory and cannot 
be eliminated by an artifice any more than the states of negative mass in the usual 
formulations of the theory." This is evidently reminiscent of the Majorana's theory 
of 1932 [inj. Bhabha then developed a theory where the wave function ip described 
particles with different values of the mass, but now "these must all be considered 
as different states of the same physical entity, just like the positron and electron, 
since the equations are irreducible" |57| . To make this point even clearer, given the 
admittedly changed point of view, Bhabha explictly predicted 

that should particles of spin 3/2 or 2 exist in nature, they would 
appear each with two possible values of the rest mass, the lower 
values of the rest mass being the stable ones in each case [...] The 
states of different rest mass being merely different states of the 
same particle, transitions from one mass to another would always 
be possible under the influence of interaction if sufficient energy 
were available for the purpose" [58] . 

The remaining developments of the theory were just mathematical in nature, mainly 
regarding the matrices a*^ appropriate for a particle that may have spins up to n/2, 
the author giving a sophisticated analysis based on the fact that a'' must transform 
as a 4-vector under elements of the Lorentz group. Indeed, it was known that the 
mathematical objects appearing in any well-founded theory of an elementary parti- 
cle should be tensors or spinors which are irreducible, that is, which cannot be split 
into two or more parts in a rclativistically invariant way, so that, correspondingly, 
that theory would be based on an irreducible set of equations. The a matrices 
should, then, generate the nucleuf0 of the representation which determined the way 
the wave function ip transformed under any given transformation of the Lorentz 
group. Bhabha showed that the problem of finding all the irreducible representa- 
tions of the a matrices could be connected with the problem of finding the nuclei 
of all irreducible representations of the Lorentz group in five dimensions, as already 
pointed out by Lubanski [54] , the solutions of which were known. 



This is formed by six infinitesimal transformations; the wave function ip is connected with 
the representations of the group 50(1,4). 
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Finally, Bliablia pointed out that, although his theory was a multi-mass and 
multi-spin one, only one equation applied for each value of n, so that the particle 
described by it displayed only the spin n/2 in all circumstances, the mass of the 
particle depending only on the maximum value of the spin. 

The general theory by Bhabha was worth of further consideration in the sub- 
sequent years (till the present days), being also discussed in many textbooks of 
1950s-1960s [59]. 

2.13. Further remarks. The relativistic wave equations describing particles with 
given spin, as discussed above, are the basic ones developed before quantum field 
theory established itself among physicists as the consistent and complete quantum 
relativistic theory describing particles. Nevertheless, people continued (until recent 
times) to consider the issue of relativistic wave equations, but the appearance of 
the Bhabha equation marked the end of a well-defined phase. Indeed, the different 
subsequent contributions were (and are) aimed either to further study the features 
and the consequences of the theories outlined above, or to generalize them to novel 
theoretical framework (such as supersymmetry, and so on). We do not consider 
here such contributions, but only mention, in the following, few results obtained in 
the years just after the fundamental papers by Bhabha and describing quite well 
what were the mentioned future investigating directions. 

The first one was contained in a paper by E. Wild [60], where the author showed 
how particular were the cases for spin 1/2 (electrons) and spin 1 (mesons) in the 
Bhabha equation (reducible, in those cases, to the electron Dirac equation and to 
the Kemmer equation), just by studying possible extensions to other types of fields, 
already considered in the general Bhabha equation. Wild deduced that "subject 
to the conditions that (1) the equations must contain no subsidiary conditions, (2) 
either the total energy must be positive definite, or it must be possible to quantize 
the equations according to Fermi statistics without using an indefinite metric in 
Hilbert space, no such extension is possible." It is evident, just from this paper, 
how mathematical overtones were starting to enter the theories considered above. 
Nevertheless, Wild also introduced an interesting physical interpretation regarding 
the Bhabha equation: "the results [obtained by Bhabha] are of considerable interest 
in that some of the equations so obtained describe 'composite' particles which have 
states of different spin and rest mass, but which, in the state of lowest rest mass, 
have spin one-half. It is suggested that the proton and neutron might be described 
by one of these equations." 

Bhabha himself, in a foundational review of 1949 [61] "on the postulational basis 
of the theory of elementary particles" , formulated a generalization of his equation 
for spin and spin 1, that is, a generalization of the Kemmer equation (j28|) . where 
the a matrices appearing in Eq. ([36]) did not satisfy the DKP algebra in p7|) , but 
rather the more general one given by: 

= 2g'^a^ + 2g^^a' + 2g^'oP . (37) 

Bhabha realized that the algebra defined by (|37)) is "not finite and one would 
expect there to be an infinite number of inequivalent irreducible sets of matrices 
satisfying [ (|37| ]. all except two of which will not satisfy" (|27|) . The corresponding 
wave equation described a particle of spin 1 having only one value of the mass. 
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Finally, we end our review with the mathematical paper by V. Bargmann and 
Wigner of 1948 [62] on "group theoretical discussion of relativistic wave equations." 
The mathematical derivation of the wave equation for a particle of spin s and mass 
m went as following. The wave function ip ofs state comprising independent, free 
Dirac particles can be written as a product ip = H^Li V'f (C ; Pt'fc ) , where Q give 
the four spin functions (i = 1,2,3,4) and p^k the 4- momentum of the i^th particle. 
Each particle would have its own 7 matrices and mass m^, and the wave equation 
is just the Dirac equation ([7]) ^'p,Puk'4' = mui! (the authors used, as was already 
usual for theoretical physicists, units such that h = c = 1). Bargmann and Wigner 
then proposed that the general relativistic wave equation for the single particle of 
spin s and mass m be derived from this last equation by setting all equal to 
m, all p^k equal to a single 4-vector pk and requiring the wave function ip (written 
above as a product) to be symmetric in spin labels v. The wave equation was then: 



[N = 2s). Such equations are sometimes called Bargmann- Wigner equations in 
later developments, and represent the final mathematical systematization of the 
relativistic wave equation for arbitrary spin, as deduced from general Lorentz group 
invariance (following the idea of Dirac-Fierz-Pauli) . 



We have seen above that Majorana contributed significantly to the issue of relativis- 
tic wave equations, two contributions of which having been published and, thus, 
made accessible to physicists. The Majorana-Oppenheimer formulation of photon 
wave mechanics was, instead, brought to the attention of the scientific community 
only in recent times |63j and, even more recently, other investigations and results 
by the Italian physicist have been discovered on related issues [64] ■ With the pub- 
lication of Majorana's study and research personal notes [T^ [13] i a number of 
outstanding contributions in different areas of physics have come to light, and in 
the following we will focus on three of them related to the topic considered here. 
All such contributions were obtained before Majorana's visit to the Heisenberg's 
Institute in Leipzig, in 1933, although more precise dates are not available [13]. 

It is remarkable and amazing how, even in this field0 Majorana anticipated sev- 
eral later results among those discussed in the previous section, and his own way of 
reasoning was different from the particular ones envisaged above. Of course, such 
unknown contributions did not infiuence the other protagonists of our story, but 
the opposite is as well true for the results that came later, so that it is interest- 
ing to compare the historical path with the completely independent reasoning of 
Majorana. 

Indirect elements of inspiration, already present in the only two papers pub- 
lished in the issue considered [15] [40], can then lead to different though similar 
perspectives. Indeed, it is intriguing to observe - as an example - the indirect in- 
spiration of Majorana's infinite-component theory [15] on Bhabha's theory of 1945 
[58) . Both papers dealt with the problem of a relativistic wave equation for ar- 
bitrary spin within a similar formalism (different from the Dirac-Fierz-Pauli one), 
and both started with first-order equations, in a form similar to the Dirac equa- 
tion, (jlj or (O. Bhabha then followed Majorana in not requiring the Klein-Gordon 

^For a general review, see Ref.s | 14| see also | 65| . 




(38) 



3. Majorana unknown contributions 
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equation to be satisfied by the components of tlie wave function and, like Majorana, 
Bhabha himself obtained a multi-mass equation, where the rest mass depended on 
the value of the spin of the particle involved. Although the subsequent develop- 
ments in Bhabha's paper |58j testify for a completely independent derivation, the 
discussion of general forthcoming results, announced in |57| and as well announced 
and then deduced in [58j , suggests nevertheless a certain already occurred "metab- 
olization" of those results, even before their effective (re-) achievement. And, as a 
matter of fact, only Majorana considered, for the first time, a multi-mass equation 
for a wave function not satisfying the relativistic energy- momentum relation ([T}. 
This is, probably, a beatiful example of latent inspiration, and the following explicit 
results obtained (and not published) by Majorana may serve also as a guide for a 
deeper understanding of the issue of relativistic wave equations and its theoretical 
development. 

In the following, we give a sketch of three theoretical contributions reported 
in the Quaderno n.4 |66| . where Majorana elaborated relativistic wave equations 
for 16-componcnt. 6-component and 5-componcnt spinors, respectively. They are 
discussed here just in the same order as the material appears in the original docu- 
ments: as we shall sec, this is an important issue, and it is evident that the author 
obtained the corresponding results just in this order, and not in the simplistic, more 
obvious, reverse order aimed at studying particular cases for obtaining the general 
results published in [T5] and discussed above (see below). 



3.1. A 16-component equation for a two-particle system. As a starting 
point, Majorana apparently studied, like de Broglie in 1934, a system formed by 
two particles each obeying the Dirac equation but, differently from the de Broglie 
case, he consdered only one particle with a non- vanishing mass to, while the other 
one was supposed massless. The reason for such a choice is not evident but it 
is nevertheless evident that the physical idea was very different from that of de 
Broglie. 

In the original manuscript, Majorana wrote directly the "Dirac" equation for the 
system in an explicit 16 x 16 matrix form, but from this it is possible to reconstruct 
easily the reasoning behind it as follows. The Dirac equations for the two particles 
were written in the usual form: 



W 1 

\- a ■ p + (3 mc ip = 0, 



W 

h a' • p' + /?' m'c 



(39) 



V^ = o, 



where a prime referred to the quantities for the second particle, with m ^ 0, m' = 0. 
As the equation describing the system at hand, however, Majorana considered that 
obtained by summing side by side the two equations in (|39p : 



WW , , o ' 
1 ha-p + CK ■ p + p mc 



i' = 0. 



(40) 



Just as a conjecture, this study could be related to Fermi's theory of beta decay and, in such 
a case, the two fermions should have been an electron and a neutrino. However, it seems that the 
present notes were written by Majorana well before the elaboration (and publication) of Fermi's 
theory, in December 1933. 
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As for de Broglie, two independent sets of (only) 4 Dirac matrices appear, here 
built as follows: 
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(42) 



where ,f3^ are the Dirac 4x4 matrices in ([6]), and I4 is the 4x4 identity matrix. 
Notably, and differently from de Broglie, each set of 16 x 16 Dirac matrices does 
satisfy the Dirac algebrg^ in ([5|) , while each matrix of one set commutes with any 
of the other set. 

The energy-momentum relation satisfied by the solutions of this Majorana equa- 
tion is, of course, not the simple one in ([IJ, but rather a generalization of it to the 
actual system; it can be written in an easily recognizable form as 

(each solution of this is, obviously, fourfold). 



(43) 



3.2. Equation for a 6-component spinor. As a subsequent step, Majorana 
then considered a particle described by a 6-component spinor ip which satisfies a 
"standard" Dirac equation. 



h CK • p + p mc 

c 



4' = 0. 



(44) 



It should be noted that, in the original manuscript, the author considered the 
Dirac equation in presence of electromagnetic interaction (introduced in the usual 
way through the 4-potential by means of the minimal coupling principle), this 
denoting not just a merely mathematical interest, but then, when he focusscd on 
the crucial point remarked below, he took the limit of no intcractiorF^ (which is, 
indeed, irrelevant for that purpose). For simplicity, as for previous discussions, we 
avoid from the beginning the introduction of electromagnetic interaction. 



^^It is clear that the matrix /?' does not appear in the equation above, since m' = 0, and 
then it was not used by Majorana. Nevertheless, we have chosen to report also its expression for 
completeness; it is deduced from the obvious requirement of completing the Dirac algebra satisfied 
also by this matrix. 

^^To be more precise, he set to zero the vector potential, but maintained a non-zero scalar 
potential. 
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In the original manuscript, it is apparent that Majorana first wrote down the 
equations in (|44| for each of the six spinorial components and then deduced the 
exphcit form of the four 6x6 Dirac matrices; they are given as follows: 
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Very interestingly, this set of matrices does not satisfy the Dirac algebra ([5]), but 
rather they do satisfy the DKP algebra in (f27| . as can be tested quite easily. For 
the sake of completeness, it should be pointed out that, in the original manuscript, 
there is no hint that Majorana was aware of this fact, thus anticipating the findings 
of Dufhn in 1938 and Kemmer in 1939. Nevertheless, it is intriguing that, instead, 
he was well aware of the Geheniau decomposition of such algebra or, to be more 
specific, of the fact that the dimension 6 algebra here considered can be decomposed 
into a dimension 5 algebra and a trivial one-dimensional one. In order to achieve 
such a result, Majorana first obtained the energy-momentum relation satisfied by 
the spinor components, which we write simply as 



^2 



2 2 2 

p — m c 



= 0. 



(46) 



This includes, obviously, the usual relativistic relation in ([T]), but an additional 
(fourfold) solution with zero energy emerges, just as in the de Broglie theory of 
1934. Then, Majorana considered the non-relativistic approximation of his theory 
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at first order, obtaining: 
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from wliich he identified the physical components satisfying the ordinary (non- 
relativistic) energy-momentum relation, thus avoiding the zero-energy solution. 
Even more importantly, however, he recognized that one spinor component van- 
ishes in the non-relativistic limit, thus obtaining a physical decomposition of the 
dimension 6 algebra, as opposed to the mathematical decomposition of the DKP 
algebra. 



3.3. 5-component equation. Majorana, however, did not content with the non- 
relativistic decomposition, and further developed the idea of a particle described 
by a 5-component spinor, along the same lines as for the 6-component theory. 
The wave equation is just as in ([^^ (the remarks above about electromagnetic 
interaction apply here as well), but with the four 5x5 matrices given by: 
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Again, such matrices satisfy the DKP algebra ([27|) . and the energy- momentum 
relation is similar to that in (|46p . with the obvious replacement of W^/c^ with 
W'^ /c^ (that is, the zero-energy solution is now threefold). 

3.4. Remarks. The three contributions, just discussed, achieved by Majorana 
clearly point out - once more - his mastery and versatility also about the issue 
of relativistic wave equations. Here, however, it is probably more interesting to 
dwell a bit about his reasoning behind calculations. 

The starting point was the description of a system of two particles, each follow- 
ing a Dirac equation. Such a system, however, was considered as a single entity, 
described by a single equation involving a 16-component spinor. The reasoning 
is similar to that of de Broglic in 1934 [21], including the procedure adopted to 
write down the expressions for the 16 x 16 Dirac matrices: in both cases, they 
were obtained in a quite obvious way as tensor product of two independent sets of 
ordinary 4x4 Dirac matrices. The differences in the final results, due to differ- 
ent technical choices, and their interpretation are not very relevant for the present 
discussion. Instead, it is quite interesting the fact that Majorana turned from his 
two-particle system described by a 16-component wave function to a one-particle 
system described by only a 6-component spinor. This is, indeed, a crucial point 
since apparently it is justified only by assuming that Majorana did realize the pos- 
sible decomposition later realized by Kemmer |27| and then publicized by Pauli 
[55] (see above). Such a conjecture seems strengthened by the further, explicit 
decomposition of the 6-component description. 

If this is correct, it is evident that the reasoning by Majorana followed exactly 
the same basic steps later followed by other people, from de Broglie to Kemmer, 
thus denoting a uniform development of such ideas. Nevertheless, some distinctive 
features are present in Majorana's work, since here the transition from 16 to 6 (or 
5-1-1) components meant a transition from the Dirac to the DKP algebra, which is 
particularly notable if we recall that Majorana obtained it just at the level of the 
wave equation (which he wrote down directly) , and not at the level of the abstract 
matrix algebra (matrices were deduced by Majorana from his equations). Moreover, 
quite intriguing is as well the mistery of how Majorana obtained his equations or, 
in other words, of how he obtained the explicit form of the intervening matrices. In 
fact, while the 16-component theory seems an obvious generalization of the Dirac's 
one, with the novel matrices again, and obviously, satisfying the Dirac algebra, this 
is not at all the case for the 6- and 5-component theory, as testified by the fact that 
the matrices that he deduced did not satisfy the known Dirac algebra. In his notes 
for these topics, Majorana did not mention any commutation or anticommutation 
relation related to the abstract algebra employed, but his alternative reasoning on 
this was through the (more directly physical) energy-momentum relations. Now, 
it is well-known that the requirement of ([T]) led, in the Dirac scheme, to the Dirac 
algebra in (O, and this could be invoked to explain the results of the Majorana's 
16-component theory, which verify the obvious generalization (|43p of the energy- 
momentum relation to the system considered. Instead, a similar reasoning could 
certainly not apply to the Majorana's 6- or 5-component theory, both for the fact 
that the energy-momentum relation in (|46|) is not at all an obvious generalization 
of ([l]), given the presence of additional zero-energy solutions, and for the fact that 
Majorana deduced a posteriori the energy-momentum relation, just by requiring a 
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non-trivial solution to the homogeneous 6- or 5-component wave equations. It is 
thus unfortunate that no additional information is available on this crucial point. 

Finally, it is remarkable that all such general features are present, though in a 
latent form, in the Majorana's paper of 1932 jl5| . where the infinite-component 
theory predicted a multi-mass equation (as in the 16-component theory) with only 
four differently dimensioned matrices, whose form and algebra was not deduced a 
priori by imposing the energy-momentum relation ([1]) (as in the 6- and 5-componcnt 
theory). In this sense, such different works by the same author are closely inter- 
connected each other, and the three contributions reported above probably served 
as the starting points for the general theory published in 1932. 

4. Pauli and Fierz about Majorana's infinite-component equation 

As discussed above, the relevance of the Majorana's paper of 1932 resides not only 
in the position (and a possible solution) of the problem of a quantum relativistic 
equation for particles with arbitrary spin, but also in its curious indirect inspiration 
on several subsequent works. Now, we will briefly consider the direct influence of 
that paper on leading scientist or, rather, their careful studies about it, not later 
resulted in known publications. 

Quite unexpectedly, a key role in the understanding of the Majorana's paper 
was played by Pauli (probably informed by Heisenberg) , who studied it at least 
from 1939 to 1947 in two distinct phases, around 1940 and in 1947. This is clearly 
testified by 9 lettert^ present in Pauli's correspondence, now published in Rcf. 
|67j , from which we quote in the following. The reason for such a strong interest is 
declared in the first letter of this set, that Pauli wrote to Bhabha on 12 April 1940: 

I believe in the existence of much more particles than known until 
now, particularly on particles with arbitrary values of the spin and 
of the charge. [...] 

My considerations about the particles with higher spins came to 
some end now. I think that they exist really, but I can fancy that 
the complication of the theory comes from the assumptions, that 
one has to describe a set of particles with a finite number of spin 
values only. May be the matter becomes simpler, if one introduces 
a priori an infinite set of spin values (compare Majorana [...]). 
Such an indication (that evidently urged Bhabha to think about what he later 
published in 1945) clearly points out that Pauli only started to study the Majorana's 
paper at that time, and further insights in it still had to come. Indeed, a full 
understanding of the Majorana theory resulted to be not so easy to achieve: "Der 
Fall unendlich-reihiger Darstellungen der Lorentzgruppe scheint mir jedenfalls noch 
nicht genugend untersucht. [...] Lesen Sie doch einmal den Majorana! 'S Or, in 
the more characteristic Pauli's style, when referring to the Wigner's paper [26] : 
"Wigner hat die Majoranaschen Arbeit nicht verstanden, wie er Mir zugegeben 

l^That is: Pauli to Bhabha, 12 April 1940; Pauli to Fierz, 3 July 1940; Pauh to Ficrz, 17 
July 1940; PauU to Ficrz, 3 September 1940; Pauh to Jauch, 1 Novenber 1940; Pauli to Fierz, 12 
February 1941; Pauli to Fierz, 29 March 1941; Fierz to Pauli, 17 March 1947; Pauh to Ficrz, 30 
March 1947. 

^^That is: "Anyway, it seems to mc that the case of the infinite-dimensional representations 
of the Lorentz group has not been studied sufficiently. [...] Read Majorana!" (Pauli to Ficrz, 3 
July 1940) 
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hat. Pauh himself, however, had to read the Majorana's paper several times 
before any definite conclusion. 

The first problem he envisaged was that of the solutions of the Majorana equa- 
tions propagating with a supcrluminal speed (already noted by Majorana himself): 

Das Schlimme an seiner Theorie ist, dafi es bci ihm ebene Wellen 
gibt, die einer imaginaren Ruhmasse entsprechen, d.h. Teilchen, die 
sich immer mit Uberlichtgeschwindigkeit bewegen gemass E/c = 
±A/p2 -p2 > Q beliebig, \p\ > po). [...] 

Die Frage ware: gibt es fiir unendlich viele Eigenfunktionen [...] 
auch solche Gleichungssysteme (bzw. Nebenbedingungen) , bei de- 
nen pathologische Losungcn mit {v^ /<?) — fc^ < ausgeschlossen 
sind? Die letztere Forderung sollte da eine ahnliche kompliziercnde 
RoUe spielen wie bei uns die Forderung positiver Encrgie bzw. 
Ladungsdichte 

It is remarkable that Pauli recognized such a problem (and only this as a real 
problem: "Ich sehc dcshalb auch gar nichts pathologisches in dcr Majoranaschen 
Gleichung [...] Das einzige, was mir bei Majorana pathologisch zu sein scheint, sind 
die Losungen mit imaginarer Masse. "El) as pathologic for the Majorana's theory, 
but nevertheless he continued to study that theory for a long time. Indeed, in a long 
letter to Fierz of 3 September 1940, Pauli re-derived (in an alternative way) the con- 
clusions obtained by Majorana, even casting the original Majorana equations into a 
different, equivalent form and comparing them with the Dirac equation. The inter- 
esting conclusion about the "pathologic solutions" was that their exclusion should 
be related directly to the Dirac-Fierz-Pauli equations: "Es scheint mir, dafi bei 
Ausschlufi von unphysikalischen Zustanden mit p§ — < im wcsentlichen unsere 
Gleichungen hcrauskommen miisscn."0 Such a feeling was likely the background 
for considering the Majorana equations as an interesting mathematical problem, 
but without interesting physical applications (Pauli to Fierz, 3 September 1940). 
Nevertheless, again, this did not prevent Pauli to further look inside the Majo- 
rana theory, even with the help of Bargmann^l searching for possible alternative 
representations of the Majorana equations in terms of differential operators. This 
problem, however, was not so easy to solve, and several failures marked Pauli's 
research: "Bis jetzt ist es uns nicht gelungen, die Darstellungen mit < J < 1 
durch Differential-operatorcn zu realisiercn. Wir wisscn abcr auch nicht, was dcr 



^'^That is: "Wigncr did not understand the Majorana's equations, as he admitted to me." 
(Pauli to Fierz, 29 March 1941) 

^*'That is: "The bad thing with his theory is that his plane waves correspond to an imaginary 
rest mass, i.e. particles moving always with a velocity greater than that of light, that is E/c = 

■ii^p^ — Pq {po > arbitrary, \p\ > po)- [•••] The question would then be: does it exist an infinite 
number of eigrnfunction [...] for those equations (or constraints) whose pathologic solutions with 
(v^ jip") — fc^ < are excluded? This last condition plays a role similar to that of requiring a 
positive energy or charge density. (Pauli to Fierz, 3 July 1940) 

^^That is: "I don't see anything pathologic in the Majorana equation [...]. The only thing 
which seems to be pathologic in Majorana's theory are the solutions with imaginary mass." (Pauli 
to Fierz, 17 July 1940) 

^®That is: "It seems to me that the exclusion of unphysical states with — p^ < should 
come essentially from our equations." (Pauli to Fierz, 3 September 1940) 

^^Bargmann published his final results on the unitary representations of the Lorentz group 
only years later [68j but, evidently, Pauli was aware of some of his results earlier. 
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Punkt J ~ 1 des Spektrums gruppentheoretisch bedeuten konnte. [...] Das ist also 
noch ein offenes Problem. However, Pauli did succeeded to obtain several matli- 
ematicl results about the Majorana theory, and his first investigations of 1940-1 
concluded with the important result that: 

Die Majoranaschen Gleichungen mit einem ^-Raum von oo vie- 
len Dimensionen geben im (C,p)-Raum zu einer reduziblen unitaren 
Darstellung der inhomogenen Lorentz-Gruppe Anlafi (die alle Falle 
Po — > 0, pg — = und Pn — p^ < umfafit, aber neben den 
Diracschen Fallen auch andere)o 

Even though for a short period of time, Pauli again considered the Majorana 
equations of 1932 in 1947, probably at the request of Fierz, who, in the meanwhile, 
went further into the mathematical inspection of them. Now, however, the trouble 
was with the mass spectrum predicted by Majorana, and re-obtained in a different 
way by Fierz (sec Eq. (fTO|l ). 

M 

me = — T > (49) 

t + 2 

by using an equivalent form of the Majorana equations. However, "die Gleichungen 
sind ja auf jeden Fall in dieser Form unbrauchbar, well gerade die grofien £ zu 
kleinen Massen gehoren. Jede Kopplung, z.B. mit Strahlungsfeldern, wird deshalb 
Ubergange nach beliebig hohen £ zur Folge haben. The attention then shifted 
to physics problems. However, Pauli cut short any possible subsequent discussion, 
by recalling the result already obtained in 1941: 

Es scheint, dafi der Gesichtspunkt der unitren Darstellung der in- 
homogenen Lorentzgruppe der Klassifikation der relativistischem 
Wellengleichungen kraftefreier Teilchen sehr angemessen ist, indem 
namlich den irreduzihlen Darstellungen gerade bestimmte Werte der 
Masse und des Spins entsprechen. Von diesem Standpunkt aus cr- 
scheinen z. B. die Majoranaschen Gleichungen als ganz willkiirlich, 
well reduzibelQ 

The initial confidence in the existence of "much more particles than known, with 
arbitrary values of the spin" finally changed: the appearance of the lucid analysis 
of Bhabha in 1945 ended the game. 



^^That is: "Up to now we have not been able to obtain representations with < J < 1 using 
differential operators. And we do not even know what it might mean the point J = 1 of the 
spectrum in group theory. [...] So this is still an open problem. (Pauli to Fierz, 12 February 1941) 

^^That is: "The Majorana equations with an infinite-dimensional f-space correspond, in the 
(Ci p)-space, to a reducible unitary representation of the inhomogeneous Lorentz group (including 
all the cases with Pq — > 0, Pq — = and Pq — p^ < 0, besides the Dirac case and other 
cases." (Pauh to Fierz, 29 March 1941) 

^^That is: "In any case, the equations are really useless in this form because, for very large £, 
masses are very small. Any coupling, for example with radiation fields, will result into transitions 
with any large £." (Fierz to Pauli, 17 March 1947) 

^^That is: " It seems that the point of view of the unitary representations of the inhomogeneous 
Lorentz group for the classification of relativistic wave equations of particles in the absence of force 
is very reasonable, namely that to certain irreducible representations correspond definite values of 
the mass and the spin. From this point of view, for example, the equations of Majorana appear 
completely arbitrary, because they are reducible!" (Pauli to Fierz, 30 March 1947) 
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5. Conclusions 

Contrary to a very common belief among physicists, we have seen that the issue of 
the relativistic quantum mechanical description of particles kept theoreticians busy 
for a long time. Indeed, although the success of the Dirac equation was ruled even 
before the experimental observation of positrons, due to the successfuU predictions 
about hydrogen atom with respect to those coming from the Klein-Gordon equation, 
the search for equations describing particles with higher spin lasted from the early 
1930s to mid 1940s, regardless of the effective experimental observations of those 
particles. 

An important source of inspiration was de Broglie theory of a composite photon 
(described by a 16-component spinor), not so much for its physical content (a 
photon is composed of an electron-positron pair or, later, by a neutrino-antineutrino 
pair), but rather for some mathematical background that later originated, on the 
one hand, the Proca equation for massive spin-1 particles and, on the other hand, 
the Kemmer equation for spin-1 plus spin-0 particles (described by a 10- plus 5- 
component spinor) . This line of reasoning led to the final settlement of Bhabha in 
1945, who developed a general first-order equation for particles with arbitrary spin 
and studied the conditions under which such a problem could be solved. 

The difficult problem of describing particles with spin higher than 1/2 was pre- 
viously (1936) considered by Dirac, and later (1939) improved and refined by Fierz 
and Pauli, who adopted an alternative, more general line of reasoning, involving 
more advanced mathematics. 

While both alternatives are diverse expressions of the same Lorentz group, crucial 
differences manifest in the two theories when requiring (Dirac-Fierz-Pauli) or not 
(Bhabha) the a priori validity of the Klein-Gordon equation for each component of 
the spinor describing the particle. In the first case, indeed, subsidiary conditions 
should be added ad hoc for the theory to be fully consistent, while this does not 
happen for the second case. But, even more importantly, while the Dirac-Fierz- 
Pauli formalism is able to describe particles with definite mass and spin, the Bhabha 
equation is a multi-mass and multi-spin equation. 

The issue of a general equation for arbitrary spin was, however, posed by Majo- 
rana early in 1932, that is just when only the Klein-Gordon and the Dirac equations 
were known as relativistic particle equations. His solution was in term of an infinite- 
component theory based on a multi-mass and multi-spin equation, but this cannot 
be simply considered just as a precursor (though a generalization, with an infinite 
number of spinor components) of the later Bhabha equation. Indeed, not only the 
general ideas underlying the Majorana and the Bhabha equations are the same, but, 
quite interestingly, the evolution of the specific line of reasoning is itself identical. 

As we have shown above by making recourse to unpublished documents, even Ma- 
jorana developed a 16-component theory describing a system of two Dirac fcrmions, 
later developed in a less detailed and more involved way by de Broglie, as well as 
a 6-component and a 5-component theory for a one-particle system, based on the 
later discovered Duffin-Kcmmer-Petiau algebra and its decomposition. Thus, the 
difference in the specific line of reasoning of Majorana and Bhabha was only in the 
period of time required for its evolution: probably few weeks in one case (Majo- 
rana), while more than ten years in the other one (Bhabha), with the contribution 
of several authors. It is very interesting the fact that the same line of reasoning 
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has evolved in an identical manner, regardless of the actors involved: in the his- 
tory of physics there are not so many similar cases that can be studied. Although 
Bhabha was introduced (in 1940) by Pauli to the Majorana's paper of 1932, the 
"intermediate theories" discussed above were not included at all in that paper (or 
in any other published one), and only their latent and very indirect influence may 
be flowed to Bhabha (if any) . 

On the other hand, the Majorana's paper of 1932 resulted to be very difficult 
to fully understand (probably, just for its pregnant meaning and latent physical 
and mathematical content) even to first-order theoreticians like Wigner and Pauli. 
In particular, Pauli soon enough (in 1940) recognized that, contrary to naive ex- 
pectations, the Majorana approach with an infinite rather than finite number of 
components greatly simplified the matter, even with respect to his own (with Fierz 
and Dirac) theory. However it occurred about one year of intense study with Fierz 
to detail his understanding of Majorana's theory, including the recognition that 
Majorana's equations corresponded to a reducible unitary representation of the 
Lorentz group (as in the Kemmer case, for example), as witnessed in Pauli's corre- 
spondence. And, quite intriguingly, only in 1947 (that is, after the appearance of 
Bhabha's, Wigner's and Bargmann's papers) Pauli finally declared his preference 
in equations with definite values of the mass and the spin. This just testifies for the 
difficulty of the problem at hand and for the depth of Majorana's reasoning and 
results. 

The saga of the relativistic quantum mechanical description of particles with 
arbitrary spin came to an end with the final mathematical systematization by 
Bargmann and Wigner in 1948 along the lines of Dirac-Fierz-Pauli (that is, equa- 
tions implementing irreducible rather than reduciblke representations of the Lorentz 
group), although later, different mathematical improvements, generalization, etc. 
of prevous theories exist that come up today. The end of the story was justified by 
the recognition that relativistic wave mechanics derived its physical relevance just 
from its incorporation into quantum field theory^ so that subsequent discussions 
mainly focused on this last framework. 

Nevertheless, contrary to naive expectations, what reviewed here has an enor- 
mous potential interest for present day research as, for example, in condensed 
matter physics. Indeed, without considering the standard case of simple super- 
conductors, where Cooper pairs are described by a scalar field |69| that, in the 
Ginzburg-Landau theory, just follows the Klein-Gordon equation in several 
materials the charge carriers behave exactly as Dirac fermons |70] , and a number of 
key phenomena are just predicted by the Dirac equation (j4]) applied to these par- 
ticles [71]. Moreover, several other investigations suggest that exotic quasiparticle 
excitations in a variety of interesting condensed matter systems follow, instead, the 
Majorana equation (|25|) . that is they are fermionic excitations that are their own 
antiparticles (Majorana fermions) [75] • Other exotic phenomena (such as, for ex- 
ample, that considered in [73]) exist that, in principle, could be described by other 
equations (such as, for example, the Kemmer equation ()28|) for describing electrons 
in these exotic materials grouped in s-wave or p-wave pairs). 

The subject is, then, worth to be further exploited in current science, with 
possible novel interesting results to come: by paraphrasing Pauli, it "has not been 
studied sufficiently." Yet. 
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